Dataset statistics
| Number of variables | 16 |
|---|---|
| Number of observations | 9023 |
| Missing cells | 2527 |
| Missing cells (%) | 1.8% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 3.0 MiB |
| Average record size in memory | 345.8 B |
Variable types
| NUM | 10 |
|---|---|
| CAT | 6 |
Reproduction
| Analysis started | 2020-03-01 12:52:23.453889 |
|---|---|
| Analysis finished | 2020-03-01 12:53:29.419453 |
| Version | pandas-profiling v2.5.0 |
| Command line | pandas_profiling --config_file config.yaml [YOUR_FILE.csv] |
| Download configuration | config.yaml |
name has a high cardinality: 8869 distinct values | High cardinality |
host_name has a high cardinality: 2423 distinct values | High cardinality |
neighbourhood has a high cardinality: 89 distinct values | High cardinality |
last_review has a high cardinality: 987 distinct values | High cardinality |
neighbourhood is highly correlated with neighbourhood_group | High Correlation |
neighbourhood_group is highly correlated with neighbourhood | High Correlation |
last_review has 1261 (14.0%) missing values | Missing |
reviews_per_month has 1261 (14.0%) missing values | Missing |
number_of_reviews has 1261 (14.0%) zeros | Zeros |
availability_365 has 2054 (22.8%) zeros | Zeros |
| Distinct count | 9023 |
|---|---|
| Unique (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 21626150.66164247 |
|---|---|
| Minimum | 2318 |
| Maximum | 40261634 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 70.6 KiB |
Quantile statistics
| Minimum | 2318 |
|---|---|
| 5-th percentile | 2516840.9 |
| Q1 | 13205937.5 |
| median | 21804258 |
| Q3 | 31882021 |
| 95-th percentile | 38502364.6 |
| Maximum | 40261634 |
| Range | 40259316 |
| Interquartile range (IQR) | 18676083.5 |
Descriptive statistics
| Standard deviation | 11201265.03 |
|---|---|
| Coefficient of variation (CV) | 0.5179500135 |
| Kurtosis | -1.033035632 |
| Mean | 21626150.66 |
| Median Absolute Deviation (MAD) | 9403002.647 |
| Skewness | -0.1574237174 |
| Sum | 1.951327574e+11 |
| Variance | 1.254683382e+14 |
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[2.31800000e+03 2.87535000e+04 1.52047500e+06 1.52056500e+06 6.36137900e+06 ... 3.93906090e+07 3.93996065e+07 3.99489895e+07 3.99500735e+07 4.02616340e+07], "bayesian blocks" binning strategy used)
| Value | Count | Frequency (%) | |
| 7178239 | 1 | < 0.1% | |
| 3818746 | 1 | < 0.1% | |
| 7089415 | 1 | < 0.1% | |
| 14011651 | 1 | < 0.1% | |
| 38213555 | 1 | < 0.1% | |
| 22818047 | 1 | < 0.1% | |
| 35120024 | 1 | < 0.1% | |
| 4439293 | 1 | < 0.1% | |
| 18390265 | 1 | < 0.1% | |
| 18291977 | 1 | < 0.1% | |
| Other values (9013) | 9013 | 99.9% |
| Value | Count | Frequency (%) | |
| 2318 | 1 | < 0.1% | |
| 5682 | 1 | < 0.1% | |
| 6606 | 1 | < 0.1% | |
| 9419 | 1 | < 0.1% | |
| 9460 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 40261634 | 1 | < 0.1% | |
| 40197071 | 1 | < 0.1% | |
| 40183377 | 1 | < 0.1% | |
| 40183149 | 1 | < 0.1% | |
| 40176359 | 1 | < 0.1% |
| Distinct count | 8869 |
|---|---|
| Unique (%) | 98.3% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Memory size | 35.3 KiB |
| SoBe Westlake Apartments 2 Bedroom | 9 |
|---|---|
| Day 1 | Summer Intern Housing | DT Seattle xx | 7 |
| Downtown Seattle | Summer Rental | Corp Housing xx | 7 |
| DT Seattle | Summer Rental | 30day special rate xx | 7 |
| Urban 1 Bedroom Apartment in SLU | 6 |
| Other values (8864) |
| Value | Count | Frequency (%) | |
| SoBe Westlake Apartments 2 Bedroom | 9 | 0.1% | |
| Day 1 | Summer Intern Housing | DT Seattle xx | 7 | 0.1% | |
| Downtown Seattle | Summer Rental | Corp Housing xx | 7 | 0.1% | |
| DT Seattle | Summer Rental | 30day special rate xx | 7 | 0.1% | |
| Urban 1 Bedroom Apartment in SLU | 6 | 0.1% | |
| SoBe Downtown Seattle Apartments | 5 | 0.1% | |
| 1 Bedroom Apartment in SLU | 4 | < 0.1% | |
| Spacious and Cozy Home | 4 | < 0.1% | |
| Corporate HighRise Apartment on Pine T1 | 4 | < 0.1% | |
| Loft Near downtown/Capitol hill | 4 | < 0.1% | |
| Other values (8859) | 8965 | 99.4% |
Length
| Max length | 136 |
|---|---|
| Mean length | 38.42535742 |
| Min length | 2 |
| Value | Count | Frequency (%) | |
| Other_Letter | 85 | 39.5% | |
| Lowercase_Letter | 27 | 12.6% | |
| Uppercase_Letter | 26 | 12.1% | |
| Other_Symbol | 20 | 9.3% | |
| Other_Punctuation | 15 | 7.0% | |
| Decimal_Number | 10 | 4.7% | |
| Math_Symbol | 6 | 2.8% | |
| Final_Punctuation | 3 | 1.4% | |
| Dash_Punctuation | 3 | 1.4% | |
| Close_Punctuation | 3 | 1.4% | |
| Other values (10) | 17 | 7.9% |
| Value | Count | Frequency (%) | |
| Common | 72 | 33.5% | |
| Han | 70 | 32.6% | |
| Latin | 54 | 25.1% | |
| Hangul | 9 | 4.2% | |
| Devanagari | 8 | 3.7% | |
| Inherited | 2 | 0.9% |
| Value | Count | Frequency (%) | |
| ASCII | 90 | 44.1% | |
| CJK | 70 | 34.3% | |
| Hangul | 9 | 4.4% | |
| Dingbats | 9 | 4.4% | |
| Misc Symbols | 9 | 4.4% | |
| Punctuation | 8 | 3.9% | |
| Devanagari | 8 | 3.9% | |
| VS | 1 | 0.5% |
host_id
Real number (ℝ≥0)
| Distinct count | 5233 |
|---|---|
| Unique (%) | 58.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 65151068.86312756 |
|---|---|
| Minimum | 20 |
| Maximum | 310961317 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 70.6 KiB |
Quantile statistics
| Minimum | 20 |
|---|---|
| 5-th percentile | 890201.9 |
| Q1 | 8534462 |
| median | 32307630 |
| Q3 | 89449231.5 |
| 95-th percentile | 253517662.7 |
| Maximum | 310961317 |
| Range | 310961297 |
| Interquartile range (IQR) | 80914769.5 |
Descriptive statistics
| Standard deviation | 77948164.05 |
|---|---|
| Coefficient of variation (CV) | 1.196421876 |
| Kurtosis | 1.080937619 |
| Mean | 65151068.86 |
| Median Absolute Deviation (MAD) | 60831791.15 |
| Skewness | 1.457367062 |
| Sum | 5.878580944e+11 |
| Variance | 6.075916279e+15 |
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[2.00000000e+01 3.05815000e+04 3.35985000e+04 7.38320000e+04 7.68775000e+04 ... 2.85003798e+08 2.85305949e+08 2.94072704e+08 2.94665106e+08 3.10961317e+08], "bayesian blocks" binning strategy used)
| Value | Count | Frequency (%) | |
| 8534462 | 346 | 3.8% | |
| 48005494 | 237 | 2.6% | |
| 50550045 | 152 | 1.7% | |
| 82961680 | 138 | 1.5% | |
| 229095817 | 117 | 1.3% | |
| 4962900 | 92 | 1.0% | |
| 114353388 | 91 | 1.0% | |
| 1243056 | 58 | 0.6% | |
| 74305 | 58 | 0.6% | |
| 222592495 | 52 | 0.6% | |
| Other values (5223) | 7682 | 85.1% |
| Value | Count | Frequency (%) | |
| 20 | 1 | < 0.1% | |
| 862 | 1 | < 0.1% | |
| 1877 | 1 | < 0.1% | |
| 2536 | 2 | < 0.1% | |
| 4193 | 4 | < 0.1% |
| Value | Count | Frequency (%) | |
| 310961317 | 1 | < 0.1% | |
| 309783739 | 2 | < 0.1% | |
| 309771614 | 1 | < 0.1% | |
| 309387078 | 2 | < 0.1% | |
| 309006739 | 1 | < 0.1% |
| Distinct count | 2423 |
|---|---|
| Unique (%) | 26.9% |
| Missing | 4 |
| Missing (%) | < 0.1% |
| Memory size | 35.3 KiB |
| Corp Condos & Apts | 346 |
|---|---|
| Zeus | 237 |
| Stay Alfred | 183 |
| Day 1 | 152 |
| Addison | 138 |
| Other values (2418) |
| Value | Count | Frequency (%) | |
| Corp Condos & Apts | 346 | 3.8% | |
| Zeus | 237 | 2.6% | |
| Stay Alfred | 183 | 2.0% | |
| Day 1 | 152 | 1.7% | |
| Addison | 138 | 1.5% | |
| Loftium | 117 | 1.3% | |
| David | 75 | 0.8% | |
| Melissa | 74 | 0.8% | |
| Michael | 69 | 0.8% | |
| Andrew | 68 | 0.8% | |
| Other values (2413) | 7560 | 83.8% |
Length
| Max length | 34 |
|---|---|
| Mean length | 7.163249474 |
| Min length | 1 |
| Value | Count | Frequency (%) | |
| Lowercase_Letter | 28 | 41.2% | |
| Uppercase_Letter | 26 | 38.2% | |
| Other_Punctuation | 6 | 8.8% | |
| Decimal_Number | 2 | 2.9% | |
| Math_Symbol | 2 | 2.9% | |
| Space_Separator | 1 | 1.5% | |
| Dash_Punctuation | 1 | 1.5% | |
| Open_Punctuation | 1 | 1.5% | |
| Close_Punctuation | 1 | 1.5% |
| Value | Count | Frequency (%) | |
| Latin | 54 | 79.4% | |
| Common | 14 | 20.6% |
| Value | Count | Frequency (%) | |
| ASCII | 66 | 100.0% |
| Distinct count | 17 |
|---|---|
| Unique (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 35.3 KiB |
| Downtown | |
|---|---|
| Other neighborhoods | |
| Capitol Hill | |
| Central Area | |
| Queen Anne | 647 |
| Other values (12) |
| Value | Count | Frequency (%) | |
| Downtown | 1748 | 19.4% | |
| Other neighborhoods | 1636 | 18.1% | |
| Capitol Hill | 916 | 10.2% | |
| Central Area | 782 | 8.7% | |
| Queen Anne | 647 | 7.2% | |
| West Seattle | 483 | 5.4% | |
| Ballard | 454 | 5.0% | |
| Rainier Valley | 432 | 4.8% | |
| Cascade | 419 | 4.6% | |
| Beacon Hill | 323 | 3.6% | |
| Other values (7) | 1183 | 13.1% |
Length
| Max length | 19 |
|---|---|
| Mean length | 11.7837748 |
| Min length | 7 |
| Value | Count | Frequency (%) | |
| Lowercase_Letter | 20 | 52.6% | |
| Uppercase_Letter | 17 | 44.7% | |
| Space_Separator | 1 | 2.6% |
| Value | Count | Frequency (%) | |
| Latin | 37 | 97.4% | |
| Common | 1 | 2.6% |
| Value | Count | Frequency (%) | |
| ASCII | 38 | 100.0% |
| Distinct count | 89 |
|---|---|
| Unique (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 35.3 KiB |
| Broadway | 543 |
|---|---|
| Belltown | 540 |
| Central Business District | 395 |
| Wallingford | 325 |
| First Hill | 316 |
| Other values (84) |
| Value | Count | Frequency (%) | |
| Broadway | 543 | 6.0% | |
| Belltown | 540 | 6.0% | |
| Central Business District | 395 | 4.4% | |
| Wallingford | 325 | 3.6% | |
| First Hill | 316 | 3.5% | |
| Minor | 291 | 3.2% | |
| Fremont | 269 | 3.0% | |
| University District | 256 | 2.8% | |
| South Lake Union | 248 | 2.7% | |
| Pike-Market | 245 | 2.7% | |
| Other values (79) | 5595 | 62.0% |
Length
| Max length | 25 |
|---|---|
| Mean length | 11.4717943 |
| Min length | 4 |
| Value | Count | Frequency (%) | |
| Lowercase_Letter | 24 | 49.0% | |
| Uppercase_Letter | 22 | 44.9% | |
| Dash_Punctuation | 1 | 2.0% | |
| Space_Separator | 1 | 2.0% | |
| Other_Punctuation | 1 | 2.0% |
| Value | Count | Frequency (%) | |
| Latin | 46 | 93.9% | |
| Common | 3 | 6.1% |
| Value | Count | Frequency (%) | |
| ASCII | 49 | 100.0% |
latitude
Real number (ℝ≥0)
| Distinct count | 6537 |
|---|---|
| Unique (%) | 72.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 47.625185596764595 |
|---|---|
| Minimum | 47.495870000000004 |
| Maximum | 47.73395 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 70.6 KiB |
Quantile statistics
| Minimum | 47.49587 |
|---|---|
| 5-th percentile | 47.540986 |
| Q1 | 47.605535 |
| median | 47.61984 |
| Q3 | 47.659275 |
| 95-th percentile | 47.698086 |
| Maximum | 47.73395 |
| Range | 0.23808 |
| Interquartile range (IQR) | 0.05374 |
Descriptive statistics
| Standard deviation | 0.04554863553 |
|---|---|
| Coefficient of variation (CV) | 0.0009563980688 |
| Kurtosis | -0.1027247102 |
| Mean | 47.6251856 |
| Median Absolute Deviation (MAD) | 0.03539203091 |
| Skewness | -0.1868813729 |
| Sum | 429722.0496 |
| Variance | 0.002074678199 |
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[47.49587 47.50955 47.524 47.55026 47.598045 ... 47.650805 47.678215 47.69673 47.705245 47.73395 ], "bayesian blocks" binning strategy used)
| Value | Count | Frequency (%) | |
| 47.61108 | 8 | 0.1% | |
| 47.61003 | 7 | 0.1% | |
| 47.61602 | 7 | 0.1% | |
| 47.61183 | 7 | 0.1% | |
| 47.61146 | 7 | 0.1% | |
| 47.61109 | 7 | 0.1% | |
| 47.61243 | 7 | 0.1% | |
| 47.61172 | 6 | 0.1% | |
| 47.6112 | 6 | 0.1% | |
| 47.61369 | 6 | 0.1% | |
| Other values (6527) | 8955 | 99.2% |
| Value | Count | Frequency (%) | |
| 47.49587 | 1 | < 0.1% | |
| 47.49656 | 1 | < 0.1% | |
| 47.49661 | 1 | < 0.1% | |
| 47.49732 | 1 | < 0.1% | |
| 47.49741 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 47.73395 | 1 | < 0.1% | |
| 47.73385 | 1 | < 0.1% | |
| 47.73369 | 1 | < 0.1% | |
| 47.73364 | 1 | < 0.1% | |
| 47.73362 | 1 | < 0.1% |
longitude
Real number (ℝ)
| Distinct count | 6215 |
|---|---|
| Unique (%) | 68.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -122.3337326875762 |
|---|---|
| Minimum | -122.41925 |
| Maximum | -122.24095 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 70.6 KiB |
Quantile statistics
| Minimum | -122.41925 |
|---|---|
| 5-th percentile | -122.389387 |
| Q1 | -122.353365 |
| median | -122.33301 |
| Q3 | -122.312565 |
| 95-th percentile | -122.283245 |
| Maximum | -122.24095 |
| Range | 0.1783 |
| Interquartile range (IQR) | 0.0408 |
Descriptive statistics
| Standard deviation | 0.0313327359 |
|---|---|
| Coefficient of variation (CV) | -0.0002561250704 |
| Kurtosis | -0.2670155781 |
| Mean | -122.3337327 |
| Median Absolute Deviation (MAD) | 0.02504650342 |
| Skewness | -0.1261254543 |
| Sum | -1103817.27 |
| Variance | 0.000981740339 |
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[-122.41925 -122.415155 -122.402825 -122.3905 -122.362995 ... -122.286195 -122.274545 -122.26429 -122.25913 -122.24095 ], "bayesian blocks" binning strategy used)
| Value | Count | Frequency (%) | |
| -122.33635 | 7 | 0.1% | |
| -122.33786 | 7 | 0.1% | |
| -122.33917 | 6 | 0.1% | |
| -122.34175 | 6 | 0.1% | |
| -122.32378 | 6 | 0.1% | |
| -122.32793 | 6 | 0.1% | |
| -122.32889 | 6 | 0.1% | |
| -122.34229 | 6 | 0.1% | |
| -122.3493 | 5 | 0.1% | |
| -122.33662 | 5 | 0.1% | |
| Other values (6205) | 8963 | 99.3% |
| Value | Count | Frequency (%) | |
| -122.41925 | 1 | < 0.1% | |
| -122.41908 | 1 | < 0.1% | |
| -122.41839 | 1 | < 0.1% | |
| -122.41791 | 1 | < 0.1% | |
| -122.41784 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| -122.24095 | 1 | < 0.1% | |
| -122.2412 | 1 | < 0.1% | |
| -122.24135 | 1 | < 0.1% | |
| -122.24204 | 1 | < 0.1% | |
| -122.24215 | 1 | < 0.1% |
room_type
Categorical
| Distinct count | 4 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 35.3 KiB |
| Entire home/apt | |
|---|---|
| Private room | |
| Shared room | 174 |
| Hotel room | 148 |
| Value | Count | Frequency (%) | |
| Entire home/apt | 6793 | 75.3% | |
| Private room | 1908 | 21.1% | |
| Shared room | 174 | 1.9% | |
| Hotel room | 148 | 1.6% |
Length
| Max length | 15 |
|---|---|
| Mean length | 14.20647235 |
| Min length | 10 |
| Value | Count | Frequency (%) | |
| Lowercase_Letter | 13 | 68.4% | |
| Uppercase_Letter | 4 | 21.1% | |
| Space_Separator | 1 | 5.3% | |
| Other_Punctuation | 1 | 5.3% |
| Value | Count | Frequency (%) | |
| Latin | 17 | 89.5% | |
| Common | 2 | 10.5% |
| Value | Count | Frequency (%) | |
| ASCII | 19 | 100.0% |
price
Real number (ℝ≥0)
| Distinct count | 400 |
|---|---|
| Unique (%) | 4.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 170.37138424027486 |
|---|---|
| Minimum | 0 |
| Maximum | 9999 |
| Zeros | 2 |
| Zeros (%) | < 0.1% |
| Memory size | 70.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 44 |
| Q1 | 80 |
| median | 119 |
| Q3 | 186.5 |
| 95-th percentile | 450 |
| Maximum | 9999 |
| Range | 9999 |
| Interquartile range (IQR) | 106.5 |
Descriptive statistics
| Standard deviation | 220.6638496 |
|---|---|
| Coefficient of variation (CV) | 1.295193149 |
| Kurtosis | 513.260198 |
| Mean | 170.3713842 |
| Median Absolute Deviation (MAD) | 104.3783858 |
| Skewness | 14.89515356 |
| Sum | 1537261 |
| Variance | 48692.53453 |
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 0. 12.5 29.5 30.5 34.5 ... 765.5 987. 1008.5 1575. 9999. ], "bayesian blocks" binning strategy used)
| Value | Count | Frequency (%) | |
| 100 | 333 | 3.7% | |
| 150 | 326 | 3.6% | |
| 75 | 233 | 2.6% | |
| 125 | 232 | 2.6% | |
| 99 | 216 | 2.4% | |
| 90 | 206 | 2.3% | |
| 120 | 196 | 2.2% | |
| 200 | 192 | 2.1% | |
| 80 | 189 | 2.1% | |
| 85 | 171 | 1.9% | |
| Other values (390) | 6729 | 74.6% |
| Value | Count | Frequency (%) | |
| 0 | 2 | < 0.1% | |
| 10 | 11 | 0.1% | |
| 15 | 26 | 0.3% | |
| 17 | 1 | < 0.1% | |
| 18 | 20 | 0.2% |
| Value | Count | Frequency (%) | |
| 9999 | 1 | < 0.1% | |
| 5400 | 1 | < 0.1% | |
| 5000 | 1 | < 0.1% | |
| 4000 | 1 | < 0.1% | |
| 3000 | 1 | < 0.1% |
minimum_nights
Real number (ℝ≥0)
| Distinct count | 47 |
|---|---|
| Unique (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.052643245040453 |
|---|---|
| Minimum | 1 |
| Maximum | 400 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 70.6 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 2 |
| Q3 | 3 |
| 95-th percentile | 30 |
| Maximum | 400 |
| Range | 399 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 14.73778795 |
|---|---|
| Coefficient of variation (CV) | 2.91684713 |
| Kurtosis | 297.289298 |
| Mean | 5.052643245 |
| Median Absolute Deviation (MAD) | 5.635676895 |
| Skewness | 14.14179398 |
| Sum | 45590 |
| Variance | 217.2023935 |
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 1. 1.5 2.5 3.5 5.5 ... 29.5 30.5 31.5 181. 400. ], "bayesian blocks" binning strategy used)
| Value | Count | Frequency (%) | |
| 2 | 3466 | 38.4% | |
| 1 | 3077 | 34.1% | |
| 3 | 1032 | 11.4% | |
| 30 | 647 | 7.2% | |
| 4 | 235 | 2.6% | |
| 5 | 175 | 1.9% | |
| 7 | 132 | 1.5% | |
| 6 | 48 | 0.5% | |
| 14 | 33 | 0.4% | |
| 10 | 31 | 0.3% | |
| Other values (37) | 147 | 1.6% |
| Value | Count | Frequency (%) | |
| 1 | 3077 | 34.1% | |
| 2 | 3466 | 38.4% | |
| 3 | 1032 | 11.4% | |
| 4 | 235 | 2.6% | |
| 5 | 175 | 1.9% |
| Value | Count | Frequency (%) | |
| 400 | 1 | < 0.1% | |
| 365 | 3 | < 0.1% | |
| 360 | 1 | < 0.1% | |
| 345 | 1 | < 0.1% | |
| 330 | 1 | < 0.1% |
| Distinct count | 408 |
|---|---|
| Unique (%) | 4.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 50.344231408622406 |
|---|---|
| Minimum | 0 |
| Maximum | 795 |
| Zeros | 1261 |
| Zeros (%) | 14.0% |
| Memory size | 70.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 3 |
| median | 18 |
| Q3 | 66 |
| 95-th percentile | 206 |
| Maximum | 795 |
| Range | 795 |
| Interquartile range (IQR) | 63 |
Descriptive statistics
| Standard deviation | 75.89981672 |
|---|---|
| Coefficient of variation (CV) | 1.507616952 |
| Kurtosis | 9.928687275 |
| Mean | 50.34423141 |
| Median Absolute Deviation (MAD) | 52.63297346 |
| Skewness | 2.673289324 |
| Sum | 454256 |
| Variance | 5760.782179 |
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.000e+00 5.000e-01 1.500e+00 4.500e+00 7.500e+00 ... 2.735e+02 3.135e+02 4.195e+02 5.545e+02 7.950e+02], "bayesian blocks" binning strategy used)
| Value | Count | Frequency (%) | |
| 0 | 1261 | 14.0% | |
| 1 | 590 | 6.5% | |
| 2 | 344 | 3.8% | |
| 3 | 305 | 3.4% | |
| 4 | 261 | 2.9% | |
| 6 | 207 | 2.3% | |
| 7 | 176 | 2.0% | |
| 5 | 175 | 1.9% | |
| 8 | 155 | 1.7% | |
| 9 | 143 | 1.6% | |
| Other values (398) | 5406 | 59.9% |
| Value | Count | Frequency (%) | |
| 0 | 1261 | 14.0% | |
| 1 | 590 | 6.5% | |
| 2 | 344 | 3.8% | |
| 3 | 305 | 3.4% | |
| 4 | 261 | 2.9% |
| Value | Count | Frequency (%) | |
| 795 | 1 | < 0.1% | |
| 778 | 1 | < 0.1% | |
| 733 | 1 | < 0.1% | |
| 640 | 1 | < 0.1% | |
| 569 | 1 | < 0.1% |
| Distinct count | 987 |
|---|---|
| Unique (%) | 12.7% |
| Missing | 1261 |
| Missing (%) | 14.0% |
| Memory size | 35.3 KiB |
| 2019-11-17 | 369 |
|---|---|
| 2019-11-11 | 331 |
| 2019-11-03 | 264 |
| 2019-11-10 | 263 |
| 2019-10-20 | 207 |
| Other values (982) |
| Value | Count | Frequency (%) | |
| 2019-11-17 | 369 | 4.1% | |
| 2019-11-11 | 331 | 3.7% | |
| 2019-11-03 | 264 | 2.9% | |
| 2019-11-10 | 263 | 2.9% | |
| 2019-10-20 | 207 | 2.3% | |
| 2019-11-04 | 205 | 2.3% | |
| 2019-11-18 | 194 | 2.2% | |
| 2019-10-27 | 161 | 1.8% | |
| 2019-11-16 | 126 | 1.4% | |
| 2019-10-21 | 123 | 1.4% | |
| Other values (977) | 5519 | 61.2% | |
| (Missing) | 1261 | 14.0% |
Length
| Max length | 10 |
|---|---|
| Mean length | 9.021722265 |
| Min length | 3 |
| Value | Count | Frequency (%) | |
| Decimal_Number | 10 | 76.9% | |
| Lowercase_Letter | 2 | 15.4% | |
| Dash_Punctuation | 1 | 7.7% |
| Value | Count | Frequency (%) | |
| Common | 11 | 84.6% | |
| Latin | 2 | 15.4% |
| Value | Count | Frequency (%) | |
| ASCII | 13 | 100.0% |
| Distinct count | 897 |
|---|---|
| Unique (%) | 11.6% |
| Missing | 1261 |
| Missing (%) | 14.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.3141084771965983 |
|---|---|
| Minimum | 0.01 |
| Maximum | 14.8 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 70.6 KiB |
Quantile statistics
| Minimum | 0.01 |
|---|---|
| 5-th percentile | 0.08 |
| Q1 | 0.48 |
| median | 1.6 |
| Q3 | 3.61 |
| 95-th percentile | 6.7995 |
| Maximum | 14.8 |
| Range | 14.79 |
| Interquartile range (IQR) | 3.13 |
Descriptive statistics
| Standard deviation | 2.242515455 |
|---|---|
| Coefficient of variation (CV) | 0.9690623743 |
| Kurtosis | 1.304659226 |
| Mean | 2.314108477 |
| Median Absolute Deviation (MAD) | 1.807217491 |
| Skewness | 1.233886034 |
| Sum | 17962.11 |
| Variance | 5.028875568 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 1 | 115 | 1.3% | |
| 0.04 | 76 | 0.8% | |
| 0.06 | 61 | 0.7% | |
| 0.07 | 59 | 0.7% | |
| 0.05 | 59 | 0.7% | |
| 0.09 | 58 | 0.6% | |
| 0.11 | 55 | 0.6% | |
| 0.15 | 54 | 0.6% | |
| 0.02 | 54 | 0.6% | |
| 0.19 | 53 | 0.6% | |
| Other values (887) | 7118 | 78.9% | |
| (Missing) | 1261 | 14.0% |
| Value | Count | Frequency (%) | |
| 0.01 | 6 | 0.1% | |
| 0.02 | 54 | 0.6% | |
| 0.03 | 48 | 0.5% | |
| 0.04 | 76 | 0.8% | |
| 0.05 | 59 | 0.7% |
| Value | Count | Frequency (%) | |
| 14.8 | 1 | < 0.1% | |
| 14.26 | 1 | < 0.1% | |
| 14.19 | 1 | < 0.1% | |
| 14.08 | 1 | < 0.1% | |
| 13.33 | 1 | < 0.1% |
calculated_host_listings_count
Real number (ℝ≥0)
| Distinct count | 37 |
|---|---|
| Unique (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 32.48199046880195 |
|---|---|
| Minimum | 1 |
| Maximum | 346 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 70.6 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 2 |
| Q3 | 8 |
| 95-th percentile | 237 |
| Maximum | 346 |
| Range | 345 |
| Interquartile range (IQR) | 7 |
Descriptive statistics
| Standard deviation | 78.64723792 |
|---|---|
| Coefficient of variation (CV) | 2.421256727 |
| Kurtosis | 8.073467018 |
| Mean | 32.48199047 |
| Median Absolute Deviation (MAD) | 48.23541032 |
| Skewness | 2.967937978 |
| Sum | 293085 |
| Variance | 6185.388032 |
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 1. 1.5 2.5 3.5 5.5 ... 55. 127.5 145. 291.5 346. ], "bayesian blocks" binning strategy used)
| Value | Count | Frequency (%) | |
| 1 | 4244 | 47.0% | |
| 2 | 1170 | 13.0% | |
| 3 | 576 | 6.4% | |
| 346 | 346 | 3.8% | |
| 237 | 237 | 2.6% | |
| 4 | 232 | 2.6% | |
| 5 | 185 | 2.1% | |
| 6 | 156 | 1.7% | |
| 152 | 152 | 1.7% | |
| 138 | 138 | 1.5% | |
| Other values (27) | 1587 | 17.6% |
| Value | Count | Frequency (%) | |
| 1 | 4244 | 47.0% | |
| 2 | 1170 | 13.0% | |
| 3 | 576 | 6.4% | |
| 4 | 232 | 2.6% | |
| 5 | 185 | 2.1% |
| Value | Count | Frequency (%) | |
| 346 | 346 | 3.8% | |
| 237 | 237 | 2.6% | |
| 152 | 152 | 1.7% | |
| 138 | 138 | 1.5% | |
| 117 | 117 | 1.3% |
| Distinct count | 365 |
|---|---|
| Unique (%) | 4.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 139.20691565998004 |
|---|---|
| Minimum | 0 |
| Maximum | 365 |
| Zeros | 2054 |
| Zeros (%) | 22.8% |
| Memory size | 70.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 7 |
| median | 90 |
| Q3 | 270 |
| 95-th percentile | 362 |
| Maximum | 365 |
| Range | 365 |
| Interquartile range (IQR) | 263 |
Descriptive statistics
| Standard deviation | 133.1438122 |
|---|---|
| Coefficient of variation (CV) | 0.9564453864 |
| Kurtosis | -1.26810398 |
| Mean | 139.2069157 |
| Median Absolute Deviation (MAD) | 117.4576913 |
| Skewness | 0.5133156261 |
| Sum | 1256064 |
| Variance | 17727.27474 |
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 0. 0.5 17.5 18.5 40.5 ... 354.5 362.5 363.5 364.5 365. ], "bayesian blocks" binning strategy used)
| Value | Count | Frequency (%) | |
| 0 | 2054 | 22.8% | |
| 365 | 238 | 2.6% | |
| 364 | 134 | 1.5% | |
| 90 | 91 | 1.0% | |
| 41 | 80 | 0.9% | |
| 363 | 77 | 0.9% | |
| 180 | 65 | 0.7% | |
| 49 | 64 | 0.7% | |
| 324 | 62 | 0.7% | |
| 18 | 54 | 0.6% | |
| Other values (355) | 6104 | 67.6% |
| Value | Count | Frequency (%) | |
| 0 | 2054 | 22.8% | |
| 1 | 40 | 0.4% | |
| 2 | 29 | 0.3% | |
| 3 | 31 | 0.3% | |
| 4 | 29 | 0.3% |
| Value | Count | Frequency (%) | |
| 365 | 238 | 2.6% | |
| 364 | 134 | 1.5% | |
| 363 | 77 | 0.9% | |
| 362 | 52 | 0.6% | |
| 361 | 39 | 0.4% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
First rows
| id | name | host_id | host_name | neighbourhood_group | neighbourhood | latitude | longitude | room_type | price | minimum_nights | number_of_reviews | last_review | reviews_per_month | calculated_host_listings_count | availability_365 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 2318 | Casa Madrona - Urban Oasis 1 block from the park! | 2536 | Megan | Central Area | Madrona | 47.61082 | -122.29082 | Entire home/apt | 296 | 7 | 29 | 2019-10-31 | 0.21 | 2 | 59 |
| 1 | 5682 | Cozy Studio, min. to downtown -WiFi | 8993 | Maddy | Delridge | South Delridge | 47.52398 | -122.35989 | Entire home/apt | 48 | 3 | 462 | 2018-11-24 | 3.92 | 1 | 0 |
| 2 | 6606 | Fab, private seattle urban cottage! | 14942 | Joyce | Other neighborhoods | Wallingford | 47.65411 | -122.33761 | Entire home/apt | 90 | 2 | 150 | 2019-09-28 | 1.19 | 3 | 49 |
| 3 | 9419 | Glorious sun room w/ memory foambed | 30559 | Angielena | Other neighborhoods | Georgetown | 47.55062 | -122.32014 | Private room | 62 | 2 | 146 | 2019-10-22 | 1.29 | 8 | 359 |
| 4 | 9460 | Downtown Convention Center B&B -- Free Minibar | 30832 | Siena | Downtown | First Hill | 47.61265 | -122.32936 | Private room | 99 | 3 | 455 | 2019-11-09 | 3.65 | 4 | 138 |
| 5 | 9531 | The Adorable Sweet Orange Craftsman | 31481 | Cassie | West Seattle | Fairmount Park | 47.55539 | -122.38474 | Entire home/apt | 165 | 3 | 39 | 2019-09-20 | 0.41 | 2 | 336 |
| 6 | 9534 | The Coolest Tangerine Dream MIL! | 31481 | Cassie | West Seattle | Fairmount Park | 47.55624 | -122.38598 | Entire home/apt | 125 | 2 | 46 | 2019-10-28 | 0.48 | 2 | 346 |
| 7 | 9596 | the down home , spacious, central and fab! | 14942 | Joyce | Other neighborhoods | Wallingford | 47.65479 | -122.33652 | Entire home/apt | 120 | 2 | 93 | 2019-09-22 | 0.91 | 3 | 0 |
| 8 | 9909 | Luna Lower - West Seattle | 33360 | Laura | West Seattle | Fairmount Park | 47.56521 | -122.37375 | Entire home/apt | 125 | 3 | 73 | 2019-10-21 | 0.60 | 8 | 347 |
| 9 | 11012 | the orange house, quiet 'n central | 14942 | Joyce | Other neighborhoods | Wallingford | 47.65448 | -122.33646 | Entire home/apt | 299 | 2 | 91 | 2019-09-01 | 0.76 | 3 | 177 |
Last rows
| id | name | host_id | host_name | neighbourhood_group | neighbourhood | latitude | longitude | room_type | price | minimum_nights | number_of_reviews | last_review | reviews_per_month | calculated_host_listings_count | availability_365 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 9013 | 40156785 | Enthralling and Comfy 1-BR Apartment in Seattle | 306367598 | Roderick | Downtown | First Hill | 47.60830 | -122.32818 | Entire home/apt | 120 | 1 | 0 | NaN | NaN | 2 | 364 |
| 9014 | 40159265 | Seattle Dreamy and Scenic 1 Bedroom Apartment | 306367598 | Roderick | Downtown | First Hill | 47.60912 | -122.32898 | Entire home/apt | 158 | 1 | 0 | NaN | NaN | 2 | 333 |
| 9015 | 40162643 | Two Bedroom Downtown Oasis (Parking Included) WS97 | 208530431 | Seattle Super Suites | Downtown | First Hill | 47.61108 | -122.32895 | Entire home/apt | 199 | 3 | 0 | NaN | NaN | 2 | 117 |
| 9016 | 40174107 | Cozy Downtown Quarters w/ City Views | 183583319 | Alex | Downtown | Belltown | 47.61554 | -122.34561 | Entire home/apt | 115 | 2 | 0 | NaN | NaN | 4 | 38 |
| 9017 | 40175430 | Clean private Rm in comfortable North Seattle home | 7435040 | Chengying | Northgate | Haller Lake | 47.72271 | -122.33583 | Private room | 40 | 1 | 0 | NaN | NaN | 8 | 234 |
| 9018 | 40176359 | 2 clean private rooms big window in North Seattle | 7435040 | Chengying | Northgate | Haller Lake | 47.72269 | -122.33539 | Private room | 80 | 2 | 0 | NaN | NaN | 8 | 234 |
| 9019 | 40183149 | Laurel's House | 21013086 | Ron Paul | Other neighborhoods | Fremont | 47.65662 | -122.34548 | Entire home/apt | 60 | 30 | 0 | NaN | NaN | 2 | 176 |
| 9020 | 40183377 | Entire House *Walker’s Pradise*Good Transit | 289666185 | Nhat | Rainier Valley | Columbia City | 47.56200 | -122.29087 | Entire home/apt | 89 | 1 | 0 | NaN | NaN | 1 | 356 |
| 9021 | 40197071 | Steps to Pike Place and Gum Wall | 226137890 | Xenia | Downtown | Pike-Market | 47.60866 | -122.33936 | Entire home/apt | 107 | 1 | 0 | NaN | NaN | 11 | 25 |
| 9022 | 40261634 | Seattle home that’s close to everything | 310961317 | Jeffrey | West Seattle | Fairmount Park | 47.54959 | -122.37772 | Entire home/apt | 120 | 1 | 0 | NaN | NaN | 1 | 173 |